A Non - Blocking Atomic - Multicast Service for Scalable In - memory Transaction Systems
نویسندگان
چکیده
Atomic Multicasting is central to managing replication. Our focus is on its deployment for enhancing scalability and performance of in-memory transactional systems where data replication is essential. When it is deployed as an external service, enhancement is demonstrably accomplished when no node within the service crashes. Since known atomic-multicast protocols block whenever a node is crashed or appears so, using them for service implementation risks the transaction system making no progress when blocking prevails. To eliminate this risk, our service simultaneously runs two protocols with distinct properties. The first one is the fastest when crashes are absent, but blocks until a crashed node is isolated. The second, newly-designed one never blocks but can fail to deliver messages with a probability that can be made vanishingly small in return for slower performance. The service uses slow protocol when the fast one is blocked and the switch-over is instantaneous. Extensive performance study confirms best crash-free performance, crashuninterrupted service and no delivery failure in practical settings. © 2014 Newcastle University. Printed and published by Newcastle University, Computing Science, Claremont Tower, Claremont Road, Newcastle upon Tyne, NE1 7RU, England. Bibliographical details EMERSON, R., EZHILCHELVAN, P. A Non-Blocking Atomic-Multicast Service for Scalable In-memory Transaction Systems [By] R. Emerson and P. Ezhilchelvan Newcastle upon Tyne: Newcastle University: Computing Science, 2014. (Newcastle University, Computing Science, Technical Report Series, No. CS-TR-1435)
منابع مشابه
Fault-Tolerant Distributed Systems: a Modular Approach to the Non-Blocking Atomic Commitment Problem
Agreement problems allow a set of processes to agree on a common output value. These problems are of primary importance in distributed systems and di cult to solve in presence of failures. This paper considers one of these problems whose practical interest is well known, namely the NonBlocking Atomic Commitment Problem. First, a generic protocol solving this problem is given and then instantiat...
متن کاملA Low-Cost Non-Blocking Atomic Commitment Protocol for Asynchronous Systems
In distributed transactional systems, an Atomic Commitment Protocol (ACP) is used to ensure the atomicity of distributed transactions even in the presence of failures. An ACP is said to be non-blocking if it allows correct participants to decide on the transaction despite the failure of others. Several non-blocking protocols have been proposed in the literature. However, none of these protocols...
متن کاملA Case Study of Agreement Problems in Distributed Systems: Non-Blocking Atomic Commitment
This paper considers an agreement problem whose practical interest is well known, namely the Non-Blocking Atomic Commitment Problem. First, a generic protocol solving this problem is given and then instantiations of its generic statements are provided for both synchronous and asynchronous distributed systems. These instantiations use a few basic components: timeout mechanism and reliable multic...
متن کاملA Scalable Non-blocking Multicast Scheme for Distributed DAG Scheduling
This paper presents an application-level non-blocking multicast scheme for dynamic DAG scheduling on large-scale distributedmemory systems. The multicast scheme takes into account both network topology and space requirement of routing tables to achieve scalability. Specifically, we prove that the scheme is deadlock-free and takes at most logN steps to complete. The routing table chooses appropr...
متن کاملRevisiting the Non-Blocking Atomic Commitment Problem in Distributed Systems
Agreement problems allow a set of processes to agree on a common output value. These problems are of primary importance in distributed systems and di cult to solve in presence of failures. This paper considers one of these problems whose practical interest is well known, namely the Non-Blocking Atomic Commitment Problem. First, a generic protocol solving this problem is given and then instantia...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2014